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MARK EMBEDDING AND DETECTION USING PROJECTIVE TRANSFORMS 

Cross-reference to Related Patent Applications 

This application claims the right of priority under 35 U.S.C § 119 based on 
5 Australian Patent Application No. 2002951815, filed on October 03 , 2002, which is 
incorporated by reference herein in its entirety as if fully set forth herein. 

Field of the Invention 

The present invention relates generally to the embedding and detection of 
watermarks in an image and, in particular, to the embedding and detection of patterns having 
10 variation in one direction only. 

Background 

In the last decade, numerous schemes have been proposed for the imperceptible 

embedding and detection of marks in signals. For a mark to be imperceptible in a signal, such 

as an image or even a video, the mark is typically embedded at a very low level when 
15 compared with the signal. For such a low-level signal to be detectable, a highly efficient 

detection process is needed. Such a process exists in the form of correlation or "matched 

filtering" in the case where the process is an optimal linear detection. 

Certain marks have near perfect correlation properties, allowing virtually all the 

energy of the mark to be concentrated into a single detection point. Unfortunately most 
20 correlation processes are effectively destroyed, or at least severely impeded, by distortions of 

the marked signal. In two dimensions, common distortions include rotation, scaling, 

anamorphic scaling and shear. 

Summary 

It is an object of the present invention to substantially overcome, or at least 
25 ameliorate, one or more disadvantages of existing arrangements. 
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Disclosed are arrangements which seek to detect an embedded watermark, the 
watermark being formed from a one-dimensional function, even when the image including the 
watermark has been rotated, by applying a projective transform on the image, followed by 
correlation with the basis function. 
5 According to an aspect of the present invention there is provided a method of 

detecting one or more patterns embedded in an image, each pattern having been formed 
substantially from a one-dimensional basis function, said method comprising the steps of: 

(a) calculating a projective transform of said image; 

(b) calculating a 1-D correlation between the projective transform and said 
10 basis function for a selection of angles; and 

(c) finding one or more peaks of said correlation, wherein the position of each 
of said peaks provides spatial parameters of one of said one or more embedded patterns. 

Another aspect of the present invention provides a method of determining 
transformations applied to an image, said image having at least three patterns embedded 
15 therein, each pattern having been formed substantially from a one-dimensional basis function 
being substantially scale invariant as herein defined, said method comprising the steps of: 

(a) calculating a projective transform of said image; 

(b) calculating a 1-D correlation between the projective transform and said 
basis function for a selection of angles; 

20 (c) finding peaks of said correlation, wherein the positions of said peaks 

provide spatial parameters of said embedded patterns; and 

(d) determining from said spatial parameters said transformations. 

Another aspect of the present invention provides a method of detecting one or 
more patterns embedded in an image, each pattern having been formed substantially from a 
25 one-dimensional basis function, said method comprising the steps of: 
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transforming said image to the frequency domain using a two-dimensional 

transform; 

resampling the transformed image onto a quasi-polar map; 

transforming said basis function to the frequency domain using a one-dimensional 

5 transform; 

multiplying the resampled transformed image with the transform of said basis 
function along radial lines to create a first result; 

inverse transforming said first result to the space domain using a one-dimensional 
transform to create a second result; and 
10 finding one or more peaks of said second result, wherein the position of each of 

said peaks provides spatial parameters of one of said one or more embedded patterns. 

Yet another aspect of the present invention provides a method of determining 
transformations applied to a first image, said first image being a transformed copy of a second 
image, said second image having at least three patterns embedded therein, each pattern having 
15 predefined parameters and having been formed substantially from a one-dimensional basis 
function being substantially scale invariant as herein defined, said method comprising the 
steps of: 

calculating a projective transform of said first image; 

calculating a 1-D correlation between the projective transform and said basis 
20 function for a selection of angles; 

finding peaks of said correlation, each peak corresponding with one of said 
embedded patterns; and 

determining said transformations from the positions of said peaks and said 
parameters of said patterns embedded into said second image. 
25 Yet another aspect of the present invention provides a method of embedding a 

watermark into an image, said method comprising the steps of: 
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maintaining a basis function, said basis function being a substantially scale invariant 



forming one or more patterns from said basis function, each pattern having no 
variation in one direction; and 

adding said basis pattern(s) to said image. 

Other aspects of the invention are also disclosed. 



One or more embodiments of the present invention will now be described with 
reference to the drawings, in which: 

Fig. 1A shows a flow diagram of a method of detecting one or more patterns 
embedded in a two-dimensional function or signal; 

Fig. IB shows a flow diagram of a method, which is an implementation of the 
method shown in Fig. 1 A of detecting one or more patterns embedded in an image; 

Fig. 1 C shows a flow diagram of a method, which is another implementation of the 
method shown in Fig. 1 A; 

Fig. ID shows a flow diagram of a method which is yet another implementation of 
the method shown in Fig. 1 A, but which may be used for detecting a general one-dimensional 
pattern; 

Fig. 2 shows a flow diagram of a method of embedding one or more patterns in the 
two-dimensional function; 

Fig. 3 shows a schematic block diagram of an apparatus for performing the 
operations of the above methods; 

Fig. 4A illustrates an example pattern; 



one-dimensional function excluding the function 




constants; 



Brief Description of the Drawings 
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Fig. 4B illustrates the axis of symmetry of the pattern illustrated in Fig. 4A together 
with the orientation angle and perpendicular displacement from the centre of the pattern 
image; 

Fig. 4C illustrates a relationship between the vectors r and p; 
5 Fig. 4D illustrates another example pattern; 

Fig. 4E illustrates the basis function used to generate the pattern illustrated in Fig. 4D 
together with the orientation angle and perpendicular displacement from the centre of the 
pattern image of the axis of the pattern; 

Fig. 5 illustrates the operation of the projective transform; 
10 Fig. 6 illustrates the projection p£r) of the example pattern illustrated in Fig. 4A 

plotted in the quasi polar space; 

Fig. 7 illustrates a resulting correlation of the 1-D correlation between the projection 
p^f) illustrated in Fig. 6 and an associated basis function; 

Fig. 8A illustrates 3 correlation peaks detected in an image having 3 embedded 
15 patterns with different parameters; 

Fig. 8B illustrates the correlation peaks detected from a distorted version of the same 
image as that in used for Fig. 8A; 

Fig. 9 shows a comparison between a fractional power-phase function x( r ) a 
rescaled version ^(0.8r) thereof, as well as a phase mismatch between the functions; 
20 Fig. 10 shows a plot of how the magnitude of the cross-correlation of fractional 

power-phase functions x{ r ) 2 nd z(fi& r ) varies as a function of the fractional power y, and 

Fig. 1 1 shows a schematic block diagram of a system that may be used to implement 
the embedding and detection of watermarks in an image. 
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Detailed Description including Best Mode 

Where reference is made in any one or more of the accompanying drawings to steps 
and/or features, which have the same reference numerals, those steps and/or features have for 
the purposes of this description the same function(s) or operation(s), unless the contrary 
5 intention appears. 

Fig. 1A shows a flow diagram of a method 200 of detecting one or more patterns 
embedded in a two-dimensional function or signal, such as an image. Before describing the 
method 200 of detecting the patterns, a method 300 of embedding such patterns in the two- 
dimensional function will be described with reference to Fig. 2. 

10 Some portions of the description which follows are explicitly or implicitly presented 

in terms of algorithms and symbolic representations of operations on data within a computer 
memory. These algorithmic descriptions and representations are the means used by those 
skilled in the data processing arts to most effectively convey the substance of their work to 
others skilled in the art. An algorithm is here, and generally, conceived to be a self-consistent 

15 sequence of steps leading to a desired result. The steps are those requiring physical 
manipulations of physical quantities. Usually, though not necessarily, these quantities take 
the form of electrical or magnetic signals capable of being stored, transferred, combined, 
compared, and otherwise manipulated within registers and memories of a computer system, or 
similar electronic device. 

20 Fig. 3 shows a schematic block diagram of an apparatus in the form of a general- 

purpose computer system 100 for performing the operations of methods 200 and 300, wherein 
the steps of methods 200 and 300 may be implemented as software, such as one or more 
application program executing within the computer system 100. 

In particular, the steps of methods 200 and 300 are effected by instructions in the 

25 software that are carried out by the computer system 100. The software may be stored in a 
computer readable medium, including the storage devices described below, for example. The 
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software is loaded into the computer system 100 from the computer readable medium, and 
then executed by the computer system 100. The use of the software in the computer system 
100 effects an advantageous apparatus for detecting one or more patterns embedded in a two- 
dimensional function, and an advantageous apparatus for embedding such patterns in the two- 
5 dimensional function. 

The computer system 100 is formed by a computer module 101, input devices 
including a keyboard 102, a mouse 103 and an imaging device 122, and output devices 
including a printer 115 and a display device 1 14. The imaging device 122 may be a scanner 
or digital camera used for obtaining a digital image. A Modulator-Demodulator (Modem) 

10 transceiver device 1 16 is used by the computer module 101 for communicating to and from a 
communications network 120, for example connectable via a telephone line 121 or other 
functional medium. The modem 1 1 6 can be used to obtain access to the Internet, and other 
network systems, such as a Local Area Network (LAN) or a Wide Area Network (WAN), and 
may be incorporated into the computer module 101 in some implementations. 

15 The computer module 101 typically includes at least one processor unit 105, and a 

memory unit 106, for example formed from semiconductor random access memory (RAM) 
and read only memory (ROM). The module 101 also includes a number of input/output (I/O) 
interfaces including a video interface 107 that couples to the video display 114, an I/O 
interface 113 for the keyboard 102, mouse 103 and imaging device 122, and an interface 108 

20 for the modem 1 16 and printer 115. A storage device 109 is provided and typically includes a 
hard disk drive 1 10 and a floppy disk drive 111. A CD-ROM drive 1 12 is typically provided 
as a non-volatile source of data. The components 105 to 113 of the computer module 101, 
typically communicate via an interconnected bus 104 and in a manner which results in a 
conventional mode of operation of the computer system 100 known to those in the relevant 

25 art. 
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Typically, the application program is resident on the hard disk drive 110 and read 
and controlled in its execution by the processor 105. Intermediate storage of the program and 
any data fetched from the network 120 may be accomplished using the semiconductor 
memory 106, possibly in concert with the hard disk drive 110. In some instances, the 

5 application program may be supplied to the user encoded on a CD-ROM or floppy disk and 
read via the corresponding drive 1 12 or 1 1 1, or alternatively may be read by the user from the 
network 120 via the modem device 1 16. Still further, the software can also be loaded into the 
computer system 100 from other computer readable media. The term "computer readable 
medium" as used herein refers to any storage or transmission medium that participates in 

10 providing instructions and/or data to the computer system 100 for execution and/or 
processing. Examples of storage media include floppy disks, magnetic tape, CD-ROM, a 
hard disk drive, a ROM or integrated circuit, a magneto-optical disk, or a computer readable 
card such as a PCMCIA card and the like, whether or not such devices are internal or external 
of the computer module 101. Examples of transmission media include radio or infra-red 

15 transmission channels as well as a network connection to another computer or networked 
device, and the Internet or Intranets including e-mail transmissions and information recorded 
on Websites and the like. 

The methods 200 and 300 may alternatively be implemented in dedicated hardware 
such as one or more integrated circuits performing the functions or sub functions of methods 

20 200 and 300. Such dedicated hardware may include graphic processors, digital signal 
processors, or one or more microprocessors and associated memories. In one implementation, 
such integrated circuits are included in an imaging device, such as a camera or scanner. 

Referring again to Fig. 2, the method 300 of embedding patterns in the two- 
dimensional function starts in step 310 where a predetermined number of patterns are 

25 generated, each with predetermined parameters, and the separate patterns are summed 
together to form a pattern image. Such operations may be performed by the processor 105 
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under control of one or more application programs. The pattern image may be stored after 
generation in the memory 106 or HDD 110. 

Each pattern has no variation in one direction only, that direction being perpendicular 
to an orientation angle f3 n with the horizontal direction. The pattern also has an axis, which 
5 may be an axis of symmetry or a nominated axis, which is parralel to the no-variation 
direction, and at a displacement v n from the centre of the pattern image. 

The patterns are typically generated from a one-dimensional basis function applied at 
the direction of variation and repeated parallel to the axis. 

In a preferred implementation, the basis function is a complex homogeneous function 
10 of the form: 

g(r) = \rf a =\ r \" cxp(ia\og^ (1) 

where r is a one-dimensional coordinate, which is not necessarily positive, while a 
and p are constants. The basis function g(r) is preferably attenuated in areas where the basis 
function has a frequency above the Nyquist frequency of the pattern image. Further, the 
15 pattern generated from basis function g{r) has an axis of symmetry, which may be made odd 
or even symmetry. Equation (1) may be considered as an amplitude function, amplitude 
modulating a phase function, with the phase function having a logarithmic phase. When such 
a complex homogeneous function g(r) is scaled, say by a factor a, the scaling only introduces 
a complex constant factor as follows: 

20 g(ar)=a p+ia g{r) (2) 

The advantage of the complex homogeneous function is that the auto-correlation of 
the complex homogeneous function is directly proportional to the cross-correlation of the 
complex homogeneous function with a scaled version of the complex homogeneous function. 
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This 'scale-invariant' property allows a watermark to be detected in an image even after a 
scale transformation by correlating the image with the basis function. 

In practice, a basis function useable as a scale invariant function only has to 
approximate this ideal property sufficiently so as to provide a dominant correlation peak. For 
example, a fractional power-phase function defined by: 



.cc 



z (r) = r p exp i—r 



(3) 



has a phase derivative (i.e. instantaneous frequency) that is similar to that of the ideal 
homogeneous function for small fractional powers y in the range|^|<l, y*0. The phase 

derivative of the fractional power-phase function %(r) is: 

^#3 = arH (4) 
dr 

while the phase derivative of the ideal homogeneous function g(r) is: 

<*arg[g(r)] = ar „ (5) 
dr 

Fig. 9 shows the real part of the fractional power-phase function %(r) in the range 
(0.5, 1) as a continuous curve, and a version of the fractional power-phase function %(r) 
rescaled by a factor of 0.8 as a dotted curve. The fractional power-phase function %(r) has 
parameters p=0, a=606n, and y=0A. The phase mismatch between the functions is also 
shown. The cross-correlation between the two functions illustrated will thus not be an ideal 
delta function, but rather some spread of energy around a point. 

Fig. 10 shows a plot of how the magnitude of the cross-correlation of fractional 
power-phase functions %(r) and j(0.8r) varies as a function of the fractional power y It can 
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be observed from the plot that the correlation drops from about 70% at y = 0.1 to near 0% at 
y = 0.2. 

Generally, a measure of a function's scale invariance is equivalent to how well the 
phase profile of the function matches that of the original after scaling. 

The ideal cross-correlation or overlap integral for a nominally orthogonal 1-D 
function with inverse square root weighting, as is the case for the function in Equation (3) 
with p=-0.5, is as follows: 



functions the phase mismatch function y/{x) = 0 , and the correlation is not attenuated. For 
small phase errors, that is the phase mismatch function \y/{x\ < 1 , the cross-correlation energy 
is as follows: 




(6) 



wherein y/{r) is a phase mismatch function. For two perfect phase matched 




2 



(7) 



Ignoring third and higher powers, and defining weighted phase statistics as: 




(8) 



then Equation (7) may be simplified to: 




(9) 



Ignoring the last term, this may be expressed as: 
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wherein a is the weighted root mean square phase mismatch. It is noted that other 
weighting, that is other than l/yfr , may be applied to the phase statistics in Equation (8) if the 
need arises. 

Preferably, the phase profiles of the unsealed and the scaled basis function match 
(apart from a constant or linear term) within a RMS of 0.7 radian. In such a case, the cross- 
correlation will not be attenuated by more than 30% when compared with the ideal. Table 1 
sets out the magnitude of the correlation for different values of the RMS phase mismatch a . 



Table 1 



RMS phase 


Correlation 


mismatch a 


|c 0 |« Vl-cr 2 


0 


1.00 


0.1 


0.99 


0.2 


0.98 


0.3 


0.95 


0.4 


0.92 


0.5 


0.87 


0.6 


0.80 


0.7 


o.7i 


0.8 


0.60 


0.9 


0.44 



Other basis functions include: 

<7, (r) = [sgn(r)]* exp(i(or' p + s)); 

Qiir)= ^ n qJ\v\ n r\, and 

n=N, 
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? 3 (r)=[sgn(r)]Vexp 



i — r 



(13) 



\r ) 

where s, p, N\ 9 N 2 , a, k and ju are constants and the function q 0 is an arbitrary 

function. The constant is preferably in the range 0.5 to 2, but excluding 1 . The constant 
k=0 or 1 . In the case where k=0, the function q 3 (r) has even symmetry, whereas in the case 
5 where k=\ , the function q 3 (r) has odd symmetry. 

Fig. 4A illustrates an example pattern generated using a real part of the basis function 
of Equation (1), with masking applied at the area adjoining the axis of symmetry to remove 
values with a frequency above the Nyquist frequency of the pattern image. In illustrating the 
example pattern, pattern values have been mapped to values in the range of [0, 255], with a 
10 value of 0 being represented as the colour black and a value of 255 being represented as the 
colour white. Fig. 4B illustrates the axis of symmetry of the pattern illustrated in Fig. 4A 
together with the orientation angle f3 n and perpendicular displacement v n from the centre of 
the pattern image. 

Referring again to Fig. 2, in order to embed patterns that are imperceptible to a 

15 human observer, the pattern image is retrieved from the memory 106 and is perceptually 

masked in step 320 by the processor 105 in order to greatly reduce the levels of the patterns 

corresponding to regions of the image having low intensity variation, and reduce by a lesser 

amount the levels of the patterns corresponding to regions of the image having high intensity 

variarion. An example measure of intensity variation is the local gradient magnitude of the 

20 luminance in the image. Other measures include second partial derivatives of the luminance; 

local estimates of the "energy" or frequency content, local variance, and more sophisticated 

estimates of human visual system masking. 

The perceptually masked pattern image, which may be called a watermark, is added 

to the two-dimensional function, such as an image, in step 330. If the image is a colour 

25 image, then the watermark is preferably added to the luminance part of a colour image. This 
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allows the watermark to survive when the watermarked image is converted from colour to a 
greyscale representation. Alternatively, the watermark may be added to one or more of the R, 
G, B, H, V, S, u, v etc channels of the colour image, or any combination thereof. Apart from 
simple algebraic addition, addition of the watermark to the image also includes dithering and 

5 half-toning. The real and imaginary parts of a complex basis function may be added 
independently to two or more channels of the colour image. Watermarks may also be applied 
to sequences of images such as video streams by sequential addition. 

The image with the embedded watermark may be stored on storage device 109 (Fig. 
3) in a digital image format, such as tiff, gif, jpeg, mpeg etc. The watermarked image may 

10 also be printed on printer 115 (Fig. 3) to create a hardcopy of the watermarked image, or may 
be communicated to another computer module 101 using the network 120. 

Referring again to Fig. 1 A where the flow diagram of the method 200 of detecting 
one or more patterns embedded in an image is shown, with the patterns embedded in the 
image using the method 300 (Fig. 2) described above. Method 200 starts in step 202 where a 

15 digital copy of the image is obtained. Typically the image may already be in a digital image 
format and stored on storage device 109 (Fig. 3). In such cases the image is converted to an 
array of pixel values. If the image is still in a hardcopy format, such as a photograph, then the 
hardcopy image is first converted into a digital format by scanning the hardcopy using the 
imaging device 122 (Fig. 3). 

20 Step 205 follows where the processor 105 undoes the perceptual masking by first 

forming a perceptual mask from the image, and then emphasising the image with the 
perceptual mask by dividing the values of the image by the corresponding values of the 
perceptual mask. 

A projective transform is then applied to the resulting image in step 210. The 
25 projective transform accumulates energy by summing values along straight lines in the image. 
This is similar to the way that the total effective path of X-rays accrues during transmission 
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through a patient's body. Fig. 5 illustrates the operation of the projective transform in that a 
projection of the values of a function h(x 9 y) is accumulated along straight lines with an angle 
0 with the vertical direction to form a projection p£r). The projection p^r) is found for a 

selection of projection angles 6'm the range (-k,7v], or possibly (~ 

5 The Radon (or equivalently Hough) transform is one such projective transform that 

may be used in step 210 and is defined as: 

+O0+CO 

R d {h(x,y)} = p e {r)= j \h{x,y)5{r - xcos0 - ysin0)dxdy (14) 

In order to derive a convenient implementation of the Radon transform for a discrete 
dataset, a useful correspondence between the projection p^r) of 2-D function h(x,y) and the 
10 slices of the function's Fourier transform is used, that correspondence being known as the 
"projection-slice theorem". 

The projection-slice theorem states that the one-dimensional Fourier transform of a 
projection of a two dimensional function is equal to a radial slice of the two-dimensional 
Fourier transform of that function. Hence: 



15 



H(u,v)= J ^h(x,y)exp[-2m(ux + vy)}ixdy (15) 

-co- CO 

wherein H(u,v) is the 2-D Fourier transform of function h{x,y). In the quasi polar 
space the angles are in the range (-;r/2,;r/2], while distance is in the range (-00,00). By 
defining quasi-polar coordinates (q,<f>) in the Fourier domain, the coordinate transform is 

w = <?cos0, v = <7sin^, and one form of the projection-slice theorem is obtained for the 
20 Fourier polar angle corresponding to the Radon projection angle (f> = 6 as follows: 

-KO 

\p 0 {r)exp[-2m(rq)\ir = H {q cos 0, q sm 6) = P e {q) (16) 

-co 
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Equation (16) is useful because it allows estimation of (the Fourier transform of) a 
Radon projection as a radial slice of the 2-D-FFT of a discrete image. This suggests that a 
discrete Radon transform may be evaluated by first performing a 2-D FFT followed by a 
Cartesian to polar remapping, using a suitable interpolation - such as bicubic, or chirp-z - to 
5 perform the resampling. 

A two-dimensional function h(x,y) formed from a one-dimensional basis function 
g(r) (e.g. Equation (1)) having an orientation angle fi n and perpendicular displacement v n may 
be written as: 

h(x,y)= g{xcosfi n +ysinp n -v n ) (17) 

10 The two-dimensional function h(x>y) formed from the complex homogeneous 

function of Equation (1) may be written in an even form h* and an odd form h~ as follows: 

*;*(*.jO=|*cosA +ysmp n -v„r' a (18) 

Kap{ x >y)= s ^ xcos Pn +rsinA, -v n \xoosl3 n + y sin fi n - v n \ P+ ' a (19) 

15 Linear combinations of the functions in Equations (18) and (19) may also be used. 

Note that a one-sided function is obtained from adding an odd and even function in equal 
proportion. 

When the projection transform is applied to a pattern h(x,y) having a variation in only 
one direction with that direction being at an orientation angle f5 n with the horizontal direction, 
20 then the values of the projection p^r) are significantly higher when the angle 6 is equal to the 
orientation angle /?„ compared to all other values of angle ft Fig. 6 illustrates the projection 
p^r) of the example pattern illustrated in Fig. 4A plotted in the quasi-polar space. It can be 
seen that the values of the projection p^r) are significantly higher when the angle 0is equal to 
the angle /?„. The projection p£r) also has even or odd symmetry about the offset distance v n . 
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Hence, by applying the Radon transforms on the functions in Equations (17), (18) 
and (19), it can be shown that the Radon transform of such a function is constrained to a line 
where 6 ~ J3 as shown in the following: 

p(r,0) = R e {h(x,y)} = g{r - v n )s{6 - fi m ) (20) 
p+ (r, 0) = R B {h; ap (x, y)}=\r-v n f° 8(9 - fi m ) (21) 

p~ (r,0) = R e {h- pap (x, y)} = sgn(r - v 0 )r-v n \ p+ia S(0 - /3 n ) (22) 

Having concentrated a two-dimensional function h{x,y) formed from an one- 
dimensional basis function g(r) onto a single line in quasi-polar space by the use of Radon 
transform, it is further possible to concentrate the function's energy into a single point (or into 
a small region near a point) by using 1-D correlation detection. If the two-dimensional 
function h(x,y) is formed from an one-dimensional basis function g(r) having scale invariant 
properties as herein defined, then the correlation detection will be scale invariant. If the 
watermark includes several basis patterns at different (and in general unknown) angles /?„ , 
then the basis patterns may be detected by repeating 1-D quasi-radial correlations (in 
coordinate r) for all values of the polar angle 6 . As with other correlations and convolutions, 
a series of 1-D correlations may be efficiently implemented by use of discrete FFT 
algorithms, specifically using 1-D FFTs of 2-D functions along the radial axis. 

In the case of Equation (21), the 1-D correlation (symbol <8> ) of a 2-D function using 
the dummy radial coordinate r| is as follows: 

c + (r^)= + J^-r| p -'V + (/7,^-H'' +,a ®p + M) (23) 

The correlation function c + (r,0) is a highly concentrated function of both r and 6 . 
With the preferred parameter p = -1/2 , the correlation function c + (r,#) has good 
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orthogonality properties. For parameter p « -1 the correlation function c*(r,6) resembles a 
near perfect delta spike function 8 (r - v n ]s(& - J3 a ) . A near perfect delta-like result may also 
be advantageously achieved by the use of phase-only-correlation. 

Similarly for the odd form h~ (Equation (19)), the 1-D correlation is: 

5 a' (r,9) = 7sgn( 7 - rfr - r\ p - ia p (rj^r) = (sgn(r)H' +fa )<8> p (r,0) (24) 

-00 

The more general correlation of the pattern h(x,y) expressed in Equation (17) is: 

+00 

c{r,9)= jg'irj-rjpirj^rj^gir^pir^) (25) 

-co 

where the asterisk denotes complex conjugation. 

Step 220 follows where the processor 105 performs a 1-D correlation between the 
10 projection p^r) and the basis function in the quasi-radial coordinate r for all possible values 
of the polar angle 6 . If the basis function is a complex function, such as that in Equation (1), 
then the complex basis function is preferably correlated with the (real) projection p^r). The 
term "correlation" also includes phase correlation and phase correlation scaled by energy. 
The resulting correlation has a peak at quasi-polar coordinates (v n ,Pn) for each embedded 
15 pattern. The result of the 1-D correlation between the projection p£f) illustrated in Fig. 6 and 
the basis function of Equation (1) is illustrated in Fig. 7. 

Finally, in step 230 the processor 105 finds the absolute peaks of the correlation. 
The orientation angle J3 n and perpendicular displacement v n of each embedded pattern is 
directly available from the quasi-polar coordinates {v n ,Pn) of the peaks. 
20 Fig. IB shows a flow diagram of a method 400, which is an implementation of the 

method 200 (Fig. 1A) of detecting one or more patterns embedded in an image. In this 
implementation, the Radon transform is used as the projective transform, and is implemented 
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by resampling in the frequency domain. The correlation with the basis function (step 220, 
Fig. 1 A) is also performed by multiplication in the frequency domain. 

Method 400 starts by the processor 105 performing steps 202 and 205, which are the 
same as those of method 200 described above. 

5 The processor 105 then computes in step 211 the 2-dimensional Fast Fourier 

transform (FFT) of the image. Preferably, before computing the FFT, the image values 
(intensities) near the image edges are first attenuated by the processor 105 so that the image 
values fade to zero gradually and smoothly towards the edges of the image. Those skilled in 
the art of image processing are well aware of such preprocessing steps. 

10 Step 212 follows where the values of the transform, which are in the Cartesian 

frequency space, are resampled by the processor 105 onto the quasi-polar Frequency space. It 
is noted that the angular spacing and the radial scaling need not be constant. A 1 -dimensional 
inverse Fast Fourier transform (IFFT) in the radial direction of the remapped transform of the 
image is calculated by the processor 105 in step 213 for all desired angle values. This 

15 provides the Radon transform of the image, which is equivalent to the output of step 210 of 
method 200 (Fig. 1A). 

Method 400 continues in step 214 where a 1 -dimensional FFT in the radial direction 
of the Radon transform is calculated by the processor 105 for all desired angle values. In 
addition, in step 215, the 1 -dimensional FFT of the (one-dimensional) basis function is 

20 calculated by the processor 105. Alternatively, the basis function may be mathematically 
transformed. The processor 105 next multiplies the transform of the basis function in step 
216 with values of the output of step 214 along constant radial lines for all angle values. The 
IFFT in the radial direction of the output of step 216 is then calculated by the processor 105 in 
step 217. Steps 214 to 217 are the frequency space equivalent of step 220 in method 200 (Fig. 

25 1A), and the output of step 217 is the same as that of step 220. 
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Finally, the processor 105 finds the absolute peaks of the correlation in step 230 in 
the manner described in relation to Fig. 1 A. 

Fig. 1C shows a flow diagram of a method 500, which is another implementation of 
the method 200 (Fig. 1 A) of detecting one or more patterns embedded in an image. The steps 
5 of method 500 are the same as those of method 400 (Fig. IB) except, because steps 213 and 
214 effectively cancel each other out, those steps are not performed in method 500. 
Accordingly, the steps of method 500 have descriptions corresponding to those described 
above. 

If a pattern is embedded a substantial distance from the centre of an image, or if an 

10 image is cropped substantially so that the embedded pattern is significantly decentred, then 
the correlation peak may suffer noticeable degradation and asymmetry. Substantial 
degradation can prevent successful detection of an underlying peak. To counteract the 
reductions in peak detectability it can be advantageous to use the Hilbert transform detection 
technique. For 1-D scale invariant patterns the Hilbert transform relation is particularly 

15 simple owing to an unusual self-Hilbert transform relationship. In practice the Hilbert 
technique simplifies in this case to correlating the bilateral (i.e two-sided) real embedded 
pattern with two separate complex, and one-sided detection templates. In practice two 
complex correlations are carried out with one-sided complex templates that are mirror 
reflections of each other. The square correlation magnitudes are then added together to give 

20 the overall correlation magnitude squared. The one-side correlations have straightforward 
interpretations in the Fourier domain (as Hilbert transforms) which allows efficient 
implementation by direct use of the Hilbert transforms. 

Methods 200, 400 and 500 will detect patterns even if the watermarked image is 
rotated. In order for the detection to detect the patterns after scaling, anamorphic scaling and 

25 shear, patterns with scale invariant correlation properties, as described hereinbefore, have to 
be used. 
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However, true scale invariance is an ideal, and is only achievable (within a complex 
constant factor) for complex functions of infinite extent in the spatial direction. Typically, an 
embedded watermark is realised as a real function of limited extent. The limit in spatial 
extent constrains the scale invariance to be effective within some range, typically from 25% to 
5 400%. The limit to a real function is more serious in that exact scale invariance, strictly 
speaking, is lost. Consider a real function based on the real part of Equation (1): 

g(x) = m\x\ p+ia \= \x\ p cos(a ln[jc]) (26) 

A rescaling by a factor s provides: 

II ) (27) 

= {cos(a ln[x])cos(a ln[s])- sin(a ln[x])sin(<z ln[s])} 

10 Clearly this is not a simple complex factor as shown in Equation (2). A new 

component has been introduced. The new sin(a ln[jc]) component is known as the quadrature 
function of thecos(a ln[;c]) function. There is an elegant manner for regaining the overall 
scale invariance in the detection methods 200, 400 and 500, and that is to use a fully complex 
matching function in the correlation procedure. The general manner can be represented as 

15 follows: 

c{r,0)= jg'(TJ-r)fi{p(v,0)}dT7 = g(r)®W{p(r,e)} (28) 

—CO 

Assuming a general scale invariant spread-spectrum/spread space function g, it can 
be proved that the correlation of a complex matching function with a real embedded function 
gives essentially the same correlation peak as a complex-complex correlation, but reduced by 
20 a factor of two. Hence, 

WxWrM' W**.'! (29) 
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There is an insubstantial difference in the detection peak shape, with the background 
difference being orders of magnitude below the peak level. Note that maintenance of the 
line's overall energy (via the Parseval / Plancherel theorem) is important for consistent peak 
values between lines with different values of the orientation angle 6, The energy is 
5 intrinsically maintained for conventional correlation; however, phase correlation requires 
specific attention to maintain the correct energy for each line. 

Referring again to Fig. 2 where the method 300 of embedding a one-dimensional 
pattern is shown, an alternative one-dimensional basis function that may be used to generate 
the pattern(s) in step 310 is a spread spectrum function of the form: 

10 g'(r) = random(seed , r) (30) 

where r is a one dimensional coordinate, while seed is the seed number of a suitably 
chosen pseudo random number generator. A nominated axis of the pattern generated from the 
basis function g'{f) coincides with the origin of the pseudo random sequence generated from 
the basis function g'(r). 

15 Fig. 4D illustrates an example pattern generated using the basis function of Equation 

(30). In illustrating the example pattern, pattern values has been mapped to values in the 
range of [0, 255], with a value of 0 being represented as the colour white and a value of 255 
being represented as the colour black. Fig. 4E illustrates the basis function g'{r) used to 
generate the pattern illustrated in Fig. 4D together with the orientation angle (5 n and 

20 perpendicular displacement v n (from the centre of the pattern image) of the axis. 

Yet another spread spectrum function g"that may be used to generate the pattern(s) 
in step 310 (Fig. 2) is the function of the form: 

g"(r) = F- 1 {const. *Mw random )} (32) 
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having a constant magnitude and pseudo random phase uniformly distributed in the 
range -n to + n . 

Other possible spread spectrum functions are M-sequences, spectrally weighted 
pseudo-noise sequences such as red noise; and sequences derived from chaotic or stochastic 
5 systems. 

However, the functions shown in Equations (30) and (31) are not scale-invariant. 
Accordingly, when any one of the functions shown in Equations (30) and (31) is correlated 
with a scaled version of itself, a dominant correlation peak will generally not result. 

As methods 400 and 500 shown in Figs. IB and 1C respectively may only be used to 

10 detect one or more scale-invariant patterns embedded in an image, Fig. ID shows a flow 
diagram of a method 450, which is another implementation of the method 200 (Fig. 1 A), but 
which may be used for detecting a general one-dimensional pattern, such as those formed 
from Equations (30) and (32), embedded in an image. Again the Radon transform is used as 
the projective transform, and is implemented by resampling in the frequency domain. 

15 Accordingly, steps 202 to 213 are the same as those described in relation to method 400 (Fig. 
IB). 

After step 213 method 450 continues in step 221 where the magnitude of the 1- 
dimensional FFT in the radial direction of the Radon transform is calculated by the processor 
105 for all desired angle values. Also, in step 222, the magnitude of the 1 -dimensional FFT 
20 of the (one-dimensional) basis function is calculated by the processor 105. 

The processor 105 next, in step 223, resamples the magnitudes of the Fourier 
transforms resulting from steps 221 and 222 onto a logarithmic sampling space. If the basis 
function g(r) has N samples, then the logarithmic sampling space used in step 223 is related to 
the frequency space of the FFT through 

25 /, = e x <> (33) 
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where f { denotes the radial frequency, i ranges from -N/2+l to N/2 and X = — In — . 

The results of the resampling are then, in step 224, correlated (using one-dimensional 
correlation) along constant radial lines for all angle values. In the result of the correlation of 
step 224, the processor 105 then, in step 225, finds the absolute peak of values, if a peak 

5 exists, along each constant radial line. If a peak exists in a constant radial line corresponding 
to a specific angle value, then the location of this peak is used to determine, in step 226, a 
scale factor k that relates the basis function g{r) to the pattern that exists in the image at that 
specific angle. The scale factor k is used in step 227 to scale the one-dimensional basis 
function g(r) to substantially the same scale as the pattern in the radial line being considered. 

10 In particular, if the peak is located at sample £ , then the scale factor k is given by: 

K = e« (34) 

The scaling may be achieved using cubic interpolation or other interpolation schemes 
such as Fourier-based interpolation. 

Method 450 continues in step 228 where the scaled basis function is correlated with 
15 the Radon transform of the image, which is the result from step 213, along the constant radial 
line being considered. Finally in step 229 the processor 105 finds the absolute peaks of the 
correlation in step 228. The location of this peak determines the perpendicular displacement 
v„ of each embedded pattern at orientation angle f3 n . 

As set out above, each pattern included in the embedded watermark has two spatial 
20 parameters, those being the perpendicular displacement v n and the orientation angle f3 n . 
Distortions result in a change in one or both the spatial parameters of the patterns. Fig. 8A 
shows 3 correlation peaks detected in an image having 3 embedded patterns with different 
parameters. Fig. 8B shows the correlation peaks detected from a distorted version of the same 
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image. As can be seen when the positions of the peaks in Figs. 8A and 8B are compared, the 
spatial parameters of the patterns have changed due to the distortions. 

Rotation, scaling, anamorphic scaling and shear are affine transformations in that 
they preserve straight lines along with the parallelism of lines, but not lengths and angles. 
Such distortions may be represented by a 2x2 matrix operating upon a coordinate vector 
(x,y) to give a coordinate (x\y') as follows: 







fa 




n 

\y ) 










<C dj 





x' = Ax 



Rotation may be written in the matrix form of Equation (35) as follows: 



cos/? sin/?Y x^ 
- sin ft cos P )\yj 



ft 






V 



(35) 



(36) 



Similarly, scaling may be written as: 



fx'^ 




(a 0^ 


( X ) 


<y'j 




,0 aj 





(37) 



Anamorphic scaling along the x and y axes is sometimes called aspect ratio change 
and has the form: 



\y J 



a x 0 
0 a 



y J 



(38) 



The scaling may be along any two directions, with those directions not necessarily 
being orthogonal. Shear distortion must therefore have the shear direction specified. In the 
case of shear in the x direction, the distortion has the form: 



o \\y) 



(39) 



and in the y direction: 
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y) 



1 OY; 



A>7 



(40) 



By using four or more patterns in the watermark, with the patterns having suitable 
spatial parameters, the total distortion may be estimated. The total distortion may also be 
decomposed into convenient combinations of the prime distortions, namely rotation, scaling, 
anamorphic scaling and shear. Consider for example a general distortion followed by 
translation, which may be expressed as: 



J 



= s 



K°2l 



'22 J 



Y JC> 



fx } 



(41) 



wherein s is a scaling factor, and (x 0 ,y 0 ) T is a translation. When the scaling factors 
is negative, the distortion is rotation by 180°. Furthermore 



det 



\. a 2\ a 21j 



= -1 



(42) 



indicates that a reflection has occurred. Consider a point (xy) on the axis before 
distortion, the point (x,y) being a distance /L n along the axis. The canonical representation of 



the point (x,y) is: 



\x = v n cos/?„ -X n sin^„ 
[y = v„ sin/?„ + X n cos/3 n 



(43) 



Representing the point (xy) as a vector r, and defining a vector p as the unit vector 
perpendicular to the axis as follows: 



Ir = *i+ vj 
jp=cos#,i + sin/? n j 

then the dot product of vectors r and p is equal to the displacement v„ as follows: 



(44) 



r«p = v n =xcos/3 n + vsin/?„ 



(45) 
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Fig. 4C illustrates the relationship between the vectors r and p 5 as well as the 
distances v„ and X„. Defining cos/?„ = c„, and sin/?„ = s n , and substituting Equation (43) into 
Equation (41) provides: 



(46) 



ix' = sa u (v„c„ -X m s m )+sa n {y m s u + A,c H )+x 0 
\y' = sa 2l (v n c„ -A„s n ) + sa 22 (v„s n +X n c n ) + y 0 

5 Writing Equation (46) in a different form provides: 

x ' - x o- v n{ a u c n +a nS„)s = x = y-^o -v«(a 2 iC, + a 22^> (4?) 
{-a u s„+a l2 c a )s " {-a 2l s n +a 22 c n )s 

{*' -XoX- a 2i\ +a 22 c„)-(y' -y 0 )(-a u s n +a l2 c n ) 

= sv n (a u c n +a u s n X~a 2] s n +a 22 c n )- (48) 

Similar to the above, vector f is the vector to point (x',y') y that is point (x,y) after 
the distortion, unit vector p is perpendicular to the axis of the distorted pattern and forming 
10 an angle /3 n with the horizontal axis, and point (x',y r ) being a distance X n along the axis. 
The canonical representation of the point {x',y') is: 



x =v n cosp n ~X n sm/3 n 
y =v n sin/?„ + i„ cos/?„ 



(49) 



Also, the dot product of vectors f and p is equal to the displacement v n after 
distortion as follows: 

15 f • p = v„ = x' cos f3 n + y sin f} n (50) 

Combining Equations (46) and (50) provides: 

tanA fe„«, -«,,*.) (51) 
{-a 2l s„ +a 22 c n ) 
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- =y 5 feng. +^!2^X- g 21^ +fl22<0-(fl2lC„ ^^22^X- fl ll^ + fl 12 C n ) 

V( a n c * +^i2^) 2 +fcn^ + fl .2^) 2 (52) 

V( a n C « + a M S nY +( a n C n + a i2 S nY 

The parameters p n and v„ are available from the position of the peak for each 

pattern detected using method 200 (or equivalently method 400, 450, or 500). Equations (51) 
and (52) include 7 unknowns. Therefore, by embedding and detecting at least 4 patterns with 
5 different parameters, and solving Equation (51) and (52), the distortion parameters s, a xu a i2 , 
tf2i, #22, *o and yo can be found. With the distortion parameters s 9 a\\, a\ 2 , a 2 \, a 22 , *o and y 0 
known, the distortions may be reversed to form the undistorted image. 

Typically, both the perpendicular displacement v n and the orientation angle fi n are 

chosen to be widely separated in the embedded patterns to give the best accuracy. Use of 
10 more than 3 patterns allows noise insensitive redundancy in the arrangement of patterns. 

In general, more than 3 patterns are required if reflection is also to be detected in 
addition to rotation, scaling, anamorphic scaling and shear. Once the distortion parameters 
are known, the original parameters, that is the perpendicular displacement v n and the 
orientation angle (5 n of each pattern, may be determined. 
15 Using method 300 for embedding a watermarking and method 200, 400 or 500 to 

detect the embedded watermark, data or a pointer to that data (for example using a URL or 
pointer to that URL) may be embedded into an image. The so-called image metadata, that is 
data about, or referring to some property of that image, is thus bound to the image in that it 
can be retrieved from the image even if the image is distorted. 
20 The metadata may include a database address, or related data. This allows images to 

be uniquely associated with a database address, even if the image is processed and any 
datafile heading removed through malicious intent or mere format conversion. Similarly, the 
metadata may include copyright information of the image. 
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The distortions that the metadata can resist include the projective transforms above, • 
but also include the following: 
Printing; 

Photocopying/copying; 
5 Scanning; 

Colour removal; 

Gamma correction/gamma change; 
JPEG compression/general compression; 
Format conversion (ie BMP to GIF); 
10 Noise addition and removal; 

Filtering, such as low-pass filtering; and 
Cropping. 

The manner in which the metadata may be encoded into the image includes: 

- Encoding the metadata in an arrangement of embedded patterns, that is, using the 
15 paramenters p n and v n of embedded patterns. The decoding, which follows detection, then 

consists in recognising the arrangement irrespective of projective transformations. 

- Encoding the metadata using an alternative watermarking scheme. Additional 
patterns are embedded using method 300, which are then detected using the method 200, 400 
or 500. Using the additional patterns, the image distortion is then determined, followed by 

20 reversing the distortion. The watermarks used to encode the metadata may then be detected 
and decoded in the usual manner. Alternatively, the transformation, once determined, may be 
used to define a mapped detection of the alternative watermark, followed by a mapped 
decoding, if required. This approach avoids direct image inversion and the associated 
computation of image warping/interpolation. 

25 In some situations it is not necessary that the embedded patterns be imperceptible to 

the human eye. Nor may it be necessary that the object undergoing embedding be an image. 
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For example a planar object, such as a silicon wafer, may have a number of patterns 
embedded, using engraving or etching, in its surface. The patterns may then be utilised to 
determine the surface's position and orientation. 

Fig. 1 1 shows a schematic block diagram of a system 600 that may be used to 
implement the embedding and detection of watermarks in an image. The system 600 includes 
a first apparatus 610 for receiving a first image 601, and for embedding a watermark into the 
image 601 to produce a second image 602. 

The first apparatus 610 may be the general-purpose computer system 100 described 
in relation to Fig. 3 wherein the first image 601 is captured using imaging device 122, or 
received from the network 120. In this case, the second image 602 may be printed on printer 
115 or directly communicated to another apparatus, such as a second apparatus 620, over the 
network 120. 

In alternative implementation the method 300 is performed directly in a scanner or a 
camera, where the first image 601 is captured using an imaging device of the scanner or 
camera, and the second image 602 is in electronic form. 

The system 600 further includes a second apparatus 620 for receiving a third image 
62 1 that may contain an embedded watermarks, and for detecting the watermark, if such a 
watermark is present. The third image 621 may be the same as the second image 602, or may 
be a distorted version of the second image. 

The second apparatus 620 may also be the general-purpose computer system 100 
described in relation to Fig. 3 wherein the third image 621 is captured using imaging device 
122, or received from the network 120. 

The second apparatus 620 is operative to then detect the watermark, if a watermark 
exists, and report such to a user. The second apparatus 620 may additionally decode any 
information that may be embedded in the placement of the respective patterns making up the 
watermark. In a case where the second apparatus 620 expect to find an image with embedded 
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patterns having predetermined parameters, the second apparatus 620 may detect that the third 
image 621 is a distorted version of the second image, and such distortions may be 
automatically reversed to produce a fourth image 622. 

In an alternative implementations one of methods 200, 400 or 500 is performed 
directly in a scanner, where the third image 621 is captured using the scanner. The scanner 
may then alert the user, automatically reverse distortions to produce the fourth image 622, or 
decode information encoded in the parameters of the embedded patterns. 

The foregoing describes only some embodiments of the present invention, and 
modifications and/or changes can be made thereto without departing from the scope and spirit 
of the invention, the embodiments being illustrative and not restrictive. 
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